Incremental Learning for Interactive E-Mail Filtering

نویسندگان

  • Ding-Yi Chen
  • Xue Li
  • Zhao Yang Dong
  • Xia Chen
چکیده

AbstrAct In this paper, we propose a framework namely, Prediction-Learning-Distillation (PLD) for interactive document classification and distilling the misclassified documents. Whenever a user points out misclas-sified documents, the PLD learns from the mistakes and identifies the same mistakes from all other classified documents. The PLD then enforces this learning for future classifications. If the classifier fails to accept relevant documents or reject irrelevant documents on certain categories, then PLD will assign those documents as new positive/negative training instances. The classifier can then strengthen its weakness by learning from these new training instances. Our experiments results have demonstrated that the proposed algorithm can learn from user identified misclassified documents, and then distil the rest successfully.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A standard Interactive Multimedia eBook Generator Engine for e-Learning Process

Introduction: Using standard authoring tools is essential to promote E-Learning in teaching-learning process. Learning content in medical sciences often consists of multimedia elements. On the other hand, it is frequently required to revise and update the medical content. Hence, access to the authoring tools that can encompass multimedia elements and allow easy content revision is helpful in e-...

متن کامل

Incremental learning based on non-incremental in- duction algorithm

The machine learning algorithms can be divided into two general types: non-incremental that processes all training examples at once and incremental that handles examples one by one. This paper describes the multi-layer incremental inference algorithm (MLII) [1] based on the non-incremental inductive inference algorithm CN2 [2]. In original, the MLII algorithm used linked with the non-incrementa...

متن کامل

ifile: An Application of Machine Learning to E-Mail Filtering

The rise of the World Wide Web and the ever-increasing amounts of machine-readable text has caused text classification to become a important aspect of machine learning. One specific application that has the potential to affect almost every user of the Internet is e-mail filtering. The WorldTalk Corporation estimates that over 60 million business people use e-mail [6]. Many more use e-mail purel...

متن کامل

Integrating Interactive Whiteboards in EFL Learners' Learning and Retention of Non-congruent Collocations

Drawing on the assumptions of socio-cognitive linguistics, focusing on the effective role of interaction in terms of reducing the cognitive burden in the process of learning, this quasi-experimental study aimed at investigating the effect of the Interactive Whiteboard (IWB) usage on the learning and retention of non-congruent collocations among 60 homogenized Iranian EFL learners, aged 18 to 24...

متن کامل

Mitigating E-Mail Threats - A Web Content Based Application

The World Wide Web is a very powerful and interactive medium and its surveillance is unavoidable for information dissemination. Extracting valuable information from the vast unstructured data is a challenging and critical issue. Web content mining plays an important role in solving these issues. The applications of WWW are widespread and one among it is E-Mail communication. Due to its simple a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJITWE

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2006